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f-^ ; ABSTRACT 

' Topological texture features were compared in their ability to classify morphological patterns known as 'honey- 

combing' that are considered indicative for the presence of fibrotic interstitial lung diseases in high-resolution 
r^ \ computed tomography (HRCT) images. For 14 patients with known occurrence of honey-combing, a stack of 70 

axial, lung kernel reconstructed images were acquired from HRCT chest exams. A set of 241 regions of interest 
'^ I of both healthy and pathological (89) lung tissue were identified by an experienced radiologist. Texture features 

2< ' were extracted using six properties calculated from gray-level co-occurrence matrices (GLCM), Minkowski Di- 

mensions (MDs), and three Minkowski Functionals (MPs, e.g. MF.euler). A fc- nearest-neighbor (fc-NN) classifier 
lyj ■ and a Multilayer Radial Basis Functions Network (RBFN) were optimized in a 10-fold cross-validation for each 

O I texture vector, and the classification accuracy was calculated on independent test sets as a quantitative measure 

c/3 ■ of automated tissue characterization. A Wilcoxon signed-rank test was used to compare two accuracy distri- 

i^^l butions and the significance thresholds were adjusted for multiple comparisons by the Bonferroni correction. 

The best classification results were obtained by the MF features, which performed significantly better than all 
the standard GLCM and MD features {p < 0.005) for both classifiers. The highest accuracy was found for 
MF.euler (97.5%, 96.6%; for the /c-NN and RBFN classifier, respectively). The best standard texture features 
were the GLCM features 'homogeneity' (91.8%, 87.2%) and 'absolute value' (90.2%, 88.5%). The resuhs indicate 
that advanced topological texture features can provide superior classification performance in computer-assisted 
QQ I diagnosis of interstitial lung diseases when compared to standard texture analysis methods. 

I/-, . Keywords: texture analysis, quantitative image analysis, lung high resolution CT, supervised learning. 

in 

O ; 1. INTRODUCTION 

The rapid progress in computed tomography (CT) acquisitions of the chest have been followed by a sharp increase 
in research on computer analysis of thoracic CT scans and the development of computer-aided diagnosis systems.'^' 
In high-resolution CT (HRCT) images of the chest, the classification of pathological patterns of interstitial lung 
j^ ' diseases (ILDs) is a complicated classification task which requires substantial expertise. Such ILD disorders are 

J^ I characterized by specific abnormal findings with a distinct texture-like appearance. One morphological pattern 

related to the heterogenous group of ILDs is 'honeycombing'.'^i'^ 

HRCT images can provide an accurate assessment of lung tissue patterns that allow automated analysis tools 
to provide quick information to the radiologists (or even non-specialists). A reliable computer-aided diagnosis 
could improve the radiologists' efficiency and avoid surgical lung biopsies for some patients. Furthermore, the 
radiologist's ability to interpret HRCT depends on their expertise and might be subjective, whereas an automated 
classification of lung patterns is consistent and reproducible. 
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Figure 1. High-resolution computed tomography (HRCT) image of the lung) 




Figure 2. ROIs extracted from HRCT images with healthy lung tissue (left) and pathological lung tissue (right). 



A computer-aided diagnosis system should be used in a pre-screening step in order to label suspicious images 
and regions that include pathological patterns. This setting incorporates the valuable experience and judgement 
of the radiologist, improves the radiologist's productivity and reduces reading fatigue. One approach to design 
such an image-based, automated detection tool is to extract relevant texture features from regions of interest 
(ROI) and the use of supervised machine learning techniques to classify the ROI as healthy or pathological 
pattern. The focus of this paper is the question whether recently developed texture features provide a better 
classification of lung patterns. 

Standard measures like features extracted from gray-level occurrence matrices (GLCM)^ calculate second- 
order statistics of the gray-level distributions. Another popular approach to texture analysis is to assess global 
fractal dimensions, e.g. the Minkowski DimensiorP). Both methods have been widely used in the medical image 
analysis problem, e.g. in the analysis of radiographs^'^ Recently developed texture features like Minkowski 
Functionals (MF)2Hlil provide information about topology of the underlying gray-level pattern within the ROI 
and include higher-order statistics. Rather than representing the ROIs with statistical properties or fractal 
dimensions, the MP feature vectors represent the shape and the connectivity of the gray levels in the lung 
pattern. This concept has been successfully applied previously in the to assessment bone quality with various 
imaging modalities .l^ESlESl 

The purpose of this work is to compare a set of standard texture features with this new approach to texture 
analysis and to train two classifiers in order to detect healthy and pathological ROIs in lung HRCT images. 



2. DATA 

The used data set consists of chest scans of 14 patients with known occurrence of honeycombing patterns. 
For each patient, a stack of 70 axial images reconstructed with a lung kernel from high- resolution computed 
tomography (HRCT) of the chest was assessed by an experienced radiologist (Fig. [1]). The resulting images are 
comparable to classical HRCT with the exception that reconstructions of a small number of slice positions are 



also available. All images have a slice thickness of Imm, an in-plane pixel size of 0.69x0.69 mm, and a slice 
distance of 5 mm. In this data set, 241 square, 2-dimensional ROIs with an edge length of 21 pixels were defined 
in 152 healthy and 89 pathological pattern occurrences in lung tissue (Fig. [2]). In each of these ROIs, the set of 
the following feature vectors of texture descriptors were calculated. 

3. METHODS 

3.1 Texture Features 

Gray level co-occurrence matrices (GLCM/^E^l a,re second order histograms that estimate the joint probability 
of a gray level pixel gi and gray level pixel gj for a given pixel spacing and direction. From the GLCMs, the least 
correlated and those most frequently used statistical features were calculatecJlSl (see Appendix): absolute value 
(absval), entropy, contrast, energy, correlation (correl), and homogeneity (homogen).In each ROI, the gray levels 
were rescaled (min-max-scaling) to 256 gray level values and texture features were calculated for ndir — 4 offset 
directions 9 = {0°, 45°, 90°, 135°} and five pixel spacings Sp = {1, 2, 3, 4, 5}. Pixels pairs were excluded from the 
GLCM if the neighbor pixel was outside the ROI. 

Fractal properties were assessed with the global Minkowski Dimension (MD),'S1E1ES1 a robust estimator of 
fractal dimensions based on morphological operations. If a two-dimensional image / is regarded as a gray-level 
surface, a volume difference Vg{s) can be calculated between a dilated and eroded conversion of the the image 
/, i.e. (/ © sg) and (/ sg) respectively, for a given structuring element g and a given scale s: 

M N 

^.(^) = EE[(^®^5)-(/e.5)]. (1) 

m— n— 

The global MD can then be defined aP 

MD.lim«fM. (2) 

s->oo log(l/s) 

In practice, the MD features were assessed by plotting the numerator and the denominator in Equation ^ 
and finding the slope using a least-square fit. For the calculation, 6 structuring elements were used: 2 isotropic 
( a square and a disk) and 4 anisotropic elements pointing in four different directions (0°, 45°, 90°, 135°). 

Morphological image analysis characterizes binary images in terms of shape (geometry) and connectivity 
(topology) given by the Minkowski Functionals (MFs) HlEHli] \i can be shown that those morphological properties 
of the objects in the binary image are completely described by the corresponging MFs.l^l^ By defining a set 
of thresholds, the gray-level images of the lung pattern can be represented as a set of binary images. For each 
of those 2D binary images, three MF features can defined: area, perimeter (boundary length) and the Euler 
characteristic x (connectivity): 

MF.area = n^ (3) 

MF.perim = -Ans + 2ne (4) 

MF. euler — n^ — Ug + Uy (5) 

where rig is the total number of white pixels, n^ the number of edges, and riy the number of vertices. These 
three MFs are functions of the gray level threshold, which characterize the underlying topological features of the 
image texture. The gray level distributions in each ROI was divided into 100 quantilcs which were then used 
as thresholds to calculate the three MF features. The resulting three feature vectors for each ROI was used as 
input for the classification task. 
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/c-NN 






RBFN 






Accu. [%] 


Sens. [%] 


Spec. [%] 


Accu. [%] 


Sens. [%] 


Spec. [%] 


GLCM.absval 


90.2 ±3.4 


90.1 ±5.8 


90.1 ±5.5 


88.5 ±3.7 


96.0 ±2.9 


75.9 ±7.6 


GLCM. entropy 


85.2 ±3.6 


85.3 ±7.6 


85.1 ±4.8 


71.1 ±4.8 


69.6 ±6.9 


73.5 ±8.7 


GLCM. contrast 


82.8 ±4.4 


84.5 ±6.9 


82.0 ±6.4 


81.0 ±5.0 


87.9 ±6.8 


70.2 ±9.3 


GLCM.correl 


77.4 ±5.2 


77.0 ±9.1 


78.0 ±7.6 


80.5 ±5.0 


84.2 ±7.0 


73.9 ±9.7 


GLCM. energy 


86.2 ±4.0 


85.2 ±6.6 


86.8 ±4.8 


83.9 ±3.8 


83.0 ±5.6 


86.0 ±7.2 


GLCM.homogen 


91.8 ±2.9 


88.3 ±5.6 


94.0 ±3.5 


87.2 ±4.3 


87.5 ±5.6 


86.6 ±8.0 


MD 


86.7 ±4.2 


84.2 ±7.7 


88.4 ±4.8 


84.3 ±3.2 


84.7 ±5.3 


83.6 ±7.8 


MF.area 


96.6 ±2.0 


96.9 ±3.4 


96.5 ±2.5 


93.3 ±2.7 


93.4 ±3.7 


93.4 ±4.5 


MF.perim 


96.1 ±2.2 


94.8 ±4.2 


97.0 ±2.4 


95.3 ±1.9 


96.2 ±2.7 


93.9 ±4.1 


MF.euler 


97.5 ±1.4 


97.5 ±3.0 


97.5 ±1.8 


96.6 ±2.5 


96.5 ±3.2 


96.8 ±3.6 



Table 1. Accuracy, sensitivity, and specificity for each texture feature vector and eacli classifier. The means and the 
standard deviations of the 50 iterations are reported. 



3.2 Classification 

After the the calculation of the texture properties, the machine learning task consists of the classification between 
healthy and pathological lung patterns in a supervised learning step. Tvifo classifiers were used in this study, a 
fc-Nearest Neighbor (A:-NN) , which reflects learning by density estimation, and Multilayer Radial Basis Function 
Neural Network (RBFN), which is a nonlinear numerical approach for this classification task.D^ In order to 
identify unknown ROIs, models are created from known labeled data in the training phase. The goal is to find 
the optimal classifier parameters that model the best boundaries between the two lung classes for each texture 
feature. For the A:-NN classifier, the free parameter was the number of nearest neighbors k, for the RBFN 
classifier, the shape parameter of the Gaussian radial basis function. The performance of optimization was 
measured by the accuracy, i.e. the total number of correctly classified data points, as a quantitative measure of 
automated tissue characterization. 

In order to generalize the resulting performance, the model parameters were calculated with a training set 
of labeled ROIs and the accuracy was calculated on an independent test set of ROIs. In one iteration, the data 
set is divided into a training set (70%) and a test set (30%). The training set is used to determine the best 
classifier parameter in a 10-fold cross-validation. This classifier model is then applied on the independent test 
set to calculate the accuracy of one iteration. This procedure was repeated 50 times resulting in an accuracy 
distribution for each texture feature and classifier. A Wilcoxon signed-rank test was used to compare two 
accuracy distributions and the significance thresholds were adjusted for multiple comparisons by the Bonferroni 
correction. 

The RBF implementation was taken from the Neural Network Toolbox v6.0.1 in Matlab R2008b (TheMath- 
Works, Natick, MA); the texture analysis, the optimization and the statistical analysis were implemented in 
Matlab as well. 

4. RESULTS 

In Tab. O the measured accuracy, sensitivity, and specificity for each texture feature vector and both classifiers 
are presented. The best classification results were obtained by the MF features, which performed significantly 
better than the GLCM and MD features (p < 0.001) for both classifiers (Fig. ^. The highest accuracy of 
97.5% and 96.6% was found for the MF.euler feature for the /c-NN and RBFN classifier, respectively. The best 
standard texture features were GLCM.absval and GLCM.homogen, which had significantly higher accuracies 
than the other GLCM and MD features (p < 0.005 for fc-NN, p < 0.001 for RBFN). 



The superior classification performance of the MF features is also reflected in the corresponding high specificity 
and sensitivity values (Tab. [1]). Only GLCM.absval had a comparable sensitivity of 96.0% but a has small 
specificity (RBFN), and only GLCM.homogen had a high specificity of 94% but a small sensitivity (fc-NN). 

5. DISCUSSION 

In this study, recently developed texture features were used to classify a pathological pattern in lung HRCT 
images. The MF features extract topological information from the image patterns and had a significantly higher 
classification accuracy than established GLCM and MD texture features. The set up of this study allows an easy 
implementation of a clinical setting and the classification can reach a high accuracy. 

Instead of extracting one feature from the set of MFp feature vectors were created that represent the complete 
topological of the MF as function of the threshold. An additional new approach was to use those high-dimensional 
feature vectors to train the A:-NN and the RBFN classifiers in order to classify the lung patterns. This combination 
of topological texture analysis and pattern recognition techniques produced high classification accuracy that can 
be used in computer-aided diagnosis applications. Because the patches in this study were small, the analysis could 
be also used to segment the complete lung tissue. Each lung pixel can be classified based on the local surrounding 
tissue and the results of this classification can be visualized to the radiologists by marking pathological areas. The 
machine learning step of the presented analysis may even be improved by comparing more classifiers as different 
supervised learning techniques could also affect the performance of the texture features in the classification 
task.EHl 

In addition, a different binning method was applied to calculate the MF features by using quantiles instead 
of bins of constant widtiP to sample the MF properties as a function of the threshold. This approach guarantees 
that each bin is represented by a minimal number of gray levels and empty vector entries are avoided. This 
binning provides a more consistent and statistically stable representation of the MF features especially for small 
ROIs like used in this work, 

The superior classification performance of the MF features correspond with the recent results on chest HRCT 
studies.Sl In current research, MF features have also been successfully used to assess bone quality in magnetic 
resonance images ,2^ simulations ,'i^multidetector CT scans,'i2and radiographs.'^ The approach of the MF features 
to represent the shape and the connectivity of the lung pattern in a quantifiable manner seems to be well-suited 
for the pathological patterns produced by ILDs disorders. 

An advantage of this method is that it can be easily implemented and applied in a clinical setting where 
it can provide high classification performances in detecting those pathological patterns. It is our hope that 
this will improve the radiologists' ability to accurately classify ILDs. However, to complete the validation of 
clinical applications, further investigation is necessary. For completely automated differential diagnosis of ILDs, 
more information will be required to be incorporated into the computer system. In this study, a defined set 
of acquisition and reconstruction parameters was used to assess the results. In future studies, the infiuence 
of the these parameters on the accuracy of the texture features should be investigated. Due to the fact that 
classification is thin-sclice CT scans is a difficult task even for experts, more radiologists should be involved to 
define a reference standard for the machine learning step. 

Furthermore, a limiting factor of this study might be that only one typical pathological lung pattern, i.e. 
honeycombing, was classified. This represents only one of many interstitial lung disease patterns that are common 
in clinical chest CT. In addition, a final objective is not only to classify between healthy and diseased tissue, 
but to differentiate between various pathological lung patterns.tlS The influence of those different pathological 
patterns on the accuracy of the texture features will be investigated in future studies. A further improvement of 
this approach is also the analysis of three dimensional volumes instead of two dimensional areas. 

6. CONCLUSION 

In conclusion, the presented results indicate that advanced topological texture features can provide superior clas- 
sification performance in computer-assisted diagnosis of interstitial lung diseases compared to standard texture 
methods. However, future studies are necessary to validate these results in a clinical setting and to provide a 
reliable differential diagnosis. 
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Figure 3. Accuracies of the texture features using the fc-NN (top) and the RBFN (bottom) classifier. In each box, the 
central mark corresponds with the median, the edges of the box are the 25th and 75th percentile, and the error bars 
extend to the most extreme data points not considering outliers which are plotted individually (crosses). 



7. APPENDIX 

For a gray-level occurrence matrix P at a given pixel distance Sp and offset angle 9, 

P{i,j) = P{i,j,Sp,0), (6) 

the following definitions for the GLCM features were used: 

N N 

Absolute Value = \J 2_. I* ^ J\PihJ) 
i=\ j=i 

N N 

Entropy = ^Yl ■^(*' ■^) ^°S2 P{h j) 
t=i j=i 

N N 

Contrast = JIII^* " J')'^(*' j) 

N N 

Energy = EEt^^*'^')]' 

N N 

Correlation = y^y^[(» - fJ.x){j - Hy)P{i,j)]/{<Tx<7y) 
i=i j=i 

N N 

Homogeneity = ^ ^ P(«,i)/(1 + |« - j|) 
i=i j=i 

where fXx and a^ are the mean and the standard deviation of the rows in the GLCM, respectively, and /ij, and 
ay are the corresponding values for the columns. 
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